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ABSTRACT 

Riboswitches regulate gene expression via ligand 
binding to an aptamer domain which induces con- 
formational changes in a regulatory expression 
platform. By unfolding and refolding single add 
adenine riboswitch molecules in an optical trap, an 
integrated picture of the folding was developed 
and related to the regulatory mechanism. Force- 
extension curves (FECs) and constant-force folding 
trajectories measured on the aptamer alone 
revealed multiple partially-folded states, including 
several misfolded states not on the native folding 
pathway. All states were correlated to key structural 
components and interactions within hierarchical 
folding pathways. FECs of the full-length riboswitch 
revealed that the thermodynamically stable con- 
formation switches upon ligand binding from a 
structure repressing translation to one permitting 
it. Along with rapid equilibration of the two struc- 
tures in the absence of adenine, these results 
support a thermodynamically-controlled regulatory 
mechanism, in contrast with the kinetic control of 
the closely-related pbuE adenine riboswitch. 
Comparison of the folding of these riboswitches 
revealed many similarities arising from shared 
structural features but also essential differences 
related to their different regulatory mechanisms. 

INTRODUCTION 

Riboswitches are gene regulatory elements in mRNA 
which modulate gene activity by sensing the concentration 
of associated small metabolites (1-3). Found in a wide 



range of organisms, they are often bipartite in structure, 
consisting of an aptamer domain which senses the target 
ligand and an expression platform which modulates gene 
expression. Ligand binding stabilizes the aptamer domain 
through changes in secondary and/or tertiary interactions, 
typically switching the structure of the expression 
platform between two alternatives and thereby altering 
transcription, translation, splicing or other processes 
during expression (1-3). 

The purine riboswitches are among the smallest and 
structurally simplest riboswitches, yet collectively they 
still demonstrate much of the variety of mechanisms 
used by more complex riboswitches (4). For example, 
the xpt guanine riboswitch controls expression via tran- 
scription termination (5), the pbuE adenine riboswitch 
through transcription anti-termination (6) and the add 
adenine riboswitch through translation activation (7). 
Despite such different mechanisms, the aptamers of 
these riboswitches all have very similar structures: a 
'tuning-fork' architecture wherein the purine binding site 
is located at a specific residue in a pocket formed at the 
junction of three helices, two of which are hairpins inter- 
acting via kissing loops and aligned on top of the third 
helix (7,8). Comparative studies of these riboswitches thus 
provide an opportunity to investigate the molecular 
features involved in their different regulatory 
mechanisms (9). 

Previous studies have investigated various different 
features of the purine riboswitch aptamers, such as the 
ligand specificity (6,10) and its structural basis (10-12), 
the kinetics of the loop-loop interaction (13), the Mg 2+ - 
dependence of the folding (14), the folding energy land- 
scape (15), the structural changes upon ligand binding 
(11,16-18) and the ligand binding/dissociation rates and 
energies, for both the aptamer and the full-length 
riboswitch (9,12,19). The relation of aptamer folding to 
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regulatory mechanism has been less well studied, however, 
whether in purine riboswitches or more generally. The 
pbuE riboswitch has been shown, along with the FMN 
riboswitch, to function via kinetically-controlled folding 
of the aptamer in competition with ligand binding and 
transcription (9,13,15,20), while the folding kinetics of 
the preQ[ riboswitch aptamer suggest that transcription 
regulation is achieved instead by ligand-induced thermo- 
dynamic changes (21). The add riboswitch has been con- 
firmed to control translation in vitro and in vivo, with 
expression levels indicating that regulation does not 
require transcription-translation coupling (9). To date, 
however, little has been done to characterize the inter- 
action between an aptamer and its corresponding expres- 
sion platform, a key question for understanding and 
manipulating riboswitch function (22). 

We addressed these issues by using force spectroscopy 
to observe the folding and unfolding trajectories of indi- 
vidual add riboswitch molecules from Vibrio vulnificus 
held under tension in a high-resolution optical trap. 
Single molecule force spectroscopy provides a high- 
precision tool for probing structural transitions (23). 
Changes in the molecular extension can be measured 
with single-nucleotide (nt) resolution as the molecule 
unfolds and refolds under a controlled load, allowing 
the structural elements involved in each folding event to 
be determined. The force-dependence of reaction kinetics 
and equilibria also permit detailed characterization of the 
folding landscapes (15,24). We first built an integrated 
picture of folding in the aptamer alone, similar to 
previous force spectroscopy work on the closely-related 
pbuE adenine riboswitch aptamer (15). Since these 
riboswitches have very similar aptamer structures but 
work by completely different regulatory mechanisms — 
translation activation (add) rather than transcription 
anti-termination (pbuE) — we can begin to discern which 
aspects of the folding arise from shared structural features 
and which may be important for the different regulatory 
mechanisms. We then extended these measurements to the 
complete riboswitch, probing the interaction between 
aptamer and expression platform and thereby 
demonstrating that the regulation is likely controlled by 
the folding thermodynamics. 



MATERIALS AND METHODS 

RNA constructs were created by inserting the sequence for 
the add aptamer alone, expression platform alone or 
full-length riboswitch (Supplementary Table SI) into the 
pMLuc-1 plasmid. The resulting transcription template 
containing the riboswitch (or its constituent parts) 
flanked by linker regions on either side was amplified by 
PCR and transcribed in vitro. Two ssDNA handles (one 
complementary to the 842-nt linker on the 3'-end of the 
transcript and labelled with biotin, the other complemen- 
tary to the 1289-nt linker on the 5'-end of the transcript 
and labelled with digoxigenin) were polymerized from the 
same template using asymmetric PCR (25). The handles 
were annealed with the RNA transcript, then incubated 
with 600-nm and 820-nm diameter polystyrene beads 



labelled with avidin DN (Vector Labs) and anti- 
digoxigenin (Roche), respectively, to create dumbbells. 
Dumbbells were placed in measuring buffer (50 mM 
MOPS pH 7.5, 130mM KC1, 4mM MgCl 2 , 200U/ml 
Superase*In RNase inhibitor (Ambion) and oxygen 
scavenging system: 40U/mL glucose oxidase, 185U/mL 
catalase and 8.3mg/ml glucose) and inserted into a 
sample chamber on a clean microscope slide in the 
optical trap. 

Measurements were made using a custom-built, 
dual-beam optical trap similar to one described previously 
(26). Briefly, two laser beams at 1064nm with independent 
position and intensity control were used to pull on the han- 
dles of the RNA constructs and apply force to the 
riboswitch. Changes in construct length resulting from 
the applied force were determined by monitoring the de- 
flection of two separate detector beams at 633 nm that 
resulted from the motion of the beads, using position sen- 
sitive diodes. Constant force measurements were made 
using a passive force clamp, by pulling the bead in one 
trap out to a distance of ~170-180nm from the trap 
centre, where the force displacement curve had zero 
slope (26). Force was changed by adjusting the laser in- 
tensity using acousto-optic deflectors. Trap stiffness (0.3 
and 0.6pN/nm for force-extension curves (FECs)) was 
calibrated as described previously (27). 

Data were recorded by custom Labview software 
(National Instruments), sampled at 20 kHz and filtered 
online at 10 kHz with an 8-pole Bessel filter. Offline 
analysis was done using custom software in Igor Pro 
(Wavemetrics). Multiple FECs were aligned to correct 
for slow drift of a few nanometres over several hours. 
Constant force data were median filtered offline in a 
1-2.5 ms window, and states with different extensions 
were separated and identified using a modified threshold- 
ing algorithm (24). Additional details on all methods and 
materials are provided in the Supplementary Data. 

RESULTS 

Folding intermediates of the aptamer 

First the aptamer alone was measured, using FECs and 
trajectories at constant force to identify states in the 
folding pathway. The optical trapping assay is illustrated 
schematically in Figure 1A. Three FECs measured in the 
absence of adenine (Figure IB: black, blue and red lines) 
display typical features: the force, F, at first rises 
nonlinearly with extension due to the elasticity of the 
double-stranded (ds) handles (28,29), until characteristic 
'sawtooth' patterns occur when the length increases 
abruptly due to unfolding of different parts of the 
aptamer (29). Four states are seen with different contour 
lengths. The change in the contour length associated with 
each unfolding transition, AL C , was found by fitting the 
aggregated FEC data from hundreds of pulls on the same 
molecule (Figure IB: grey dots) to two extensible 
worm-like chain (WLC) models (30) in series, one for 
the ds handles, and a second for the unfolded ssRNA 
(31). The average values of AL C determined from more 
than 5200 FECs measured on four different molecules 
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Figure 1. Force spectroscopy of add aptamer alone. (A) RNA containing the riboswitch aptamer flanked by 2kb-long handle sequences was 
annealed to DNA strands complementary to the handles and attached to beads held in optical traps. (B) FECs in the absence of adenine reveal 
two intermediate states corresponding to the unfolding of everything except hairpins P2 and P3, followed by hairpin P2. Three FECs (black, blue, 
red) are plotted above the aggregated data from 700 FECs (grey dots). WLC fits are shown for the four states: F, fully folded (purple); U, 
fully-unfolded (yellow); P3-P3, folded (green) and P2P3, both P2 and P3 folded (cyan). (C) Adenine binding resulted in similar behaviour but 
higher unfolding forces. State P2P3 was observed less frequently if at all. (D) The extension as a function of time at different levels of constant force 
in the absence of adenine reveals five distinct states corresponding to the major structural features: fully-unfolded at the largest extension ('U'), then 
P3 folded ('P3'), then both P2 and P3 folded but no loop-loop interaction ('P2P3'), then the P2-P3 kissing loop complex (labelled 'P1_U\ PI 
unfolded) and finally fully-folded ('F'). Only short segments of the full records are displayed. (E) Histograms of the full extension records show 
force-dependent occupancies of the states. The P1_U state always has low occupancy. 



were: AL C = 39.6 ± 0.5 nm from the fully folded state 
(labelled 'F', purple line) to the fully unfolded state 
(labelled 'U', yellow line), AL C = 19 ± 1 nm from F to 
the shorter intermediate (labelled 'P2P3', cyan line), and 
AL C = 30.3 ± 0.5 nm from F to the longer intermediate 
(labelled 'P3', green line). All uncertainties are reported 
as the standard error. 

We related these states to the unfolding of distinct 
structural elements in the aptamer (Figure IB, inset) 
via the AL C values expected from these structures: 
AL C = N nt - L c nt — An b 'dh, where N nt is the number of nu- 
cleotides unfolded, L c nt = 0.59 nm is the contour length/nt 
for ssRNA (32), A« h is the number of RNA helices 
removed during unfolding and d h = 2.2 nm is the 
diameter of the A-form dsRNA helix (32). The AL C 
values expected for different structures are listed in 
Supplementary Table S2, alongside the measured values. 
We found excellent agreement with the expected value for 
full unfolding (F to U), confirming that the aptamer is 
being completely unfolded. PI unfolding by itself was 
not observed, but the intermediate labelled P2P3 
matched the result expected for unfolding PI and the 
loop-loop interaction (i.e. everything but P2 and P3). 
The last intermediate (labelled P3) was in good agreement 
with the expectation for unfolding everything except P3. 
However, since hairpins P2 and P3 differ by only 2 nt in 
length, additional confirmation for this state was obtained 
using an anti-sense DNA oligomer complementary to the 



5' stem and loop of hairpin P2 to block P2 folding, and 
another complementary to the loop and 3' stem of hairpin 
P3 to block P3 folding (Supplementary Figure SI). 
Notably, whereas state F never reformed once it was 
unfolded in a given FEC, multiple transitions between 
states P2P3, P3 and U were often observed (Figure IB), 
indicating much faster kinetics for hairpins P2 and P3 
than for helix PI and the junction. 

Similar behaviour was observed when measurements 
were repeated in the presence of saturating adenine con- 
centration (200 uM >> K u ~ 600 nM (12)), as shown in 
Figure 1C. In this case, the aptamer was held at i 7 ~0 for 
5 s between FECs to ensure that adenine binding occurred 
(15). The principal change due to adenine binding was an 
increase in the typical unfolding force for the fully-folded 
state, and a concomitant reduction in the population of 
the intermediate states; P2P3 was in particular often not 
observed. The average contour length changes between 
states, measured from >3000 FECs on eight different mol- 
ecules, were found to be identical to the results above 
within experimental uncertainty, indicating the same inter- 
mediates as when adenine is absent (Supplementary 
Table S2). 

To characterize the intermediate states in greater detail, 
folding trajectories were measured at constant force using 
a passive force clamp (26). The aptamer was first fully 
unfolded at high force then the force was reduced in 
discrete steps, measuring the molecular extension while 
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maintaining a constant force after each step for up to 
1 min to observe equilibrium behaviour. The force was 
stepped down until the aptamer was fully folded. In the 
absence of adenine, transitions between five different ex- 
tensions were observed (Figure ID), indicating five 
separate structural elements forming in apparently se- 
quential order. The populations of these states changed 
as the force was reduced, as seen from histograms of the 
extension (Figure IE), with the more-folded states (at 
shorter extensions) becoming more populated at lower 
forces. All five states could, however, be observed coexist- 
ing in equilibrium at F~ 10 pN. 

Analogously to the FEC analysis, the states were 
identified by converting the extension changes between 
them into the number of nucleotides folded, using 
N m = [Ax(F) + An h • d h ]/Ax nt (F), where Ax nt (F) is the ex- 
tension/nt at a given force. The first folding transition 
from the unfolded state, at i 7 ~ll-14pN (Figure ID, 
red), involved 19.0 ± 0.4 nt, precisely the number 
expected for hairpin P3. The second transition (at 
F~10-12pN) involved 21.3 ± 0.4 nt, matching the ex- 
pectation for P2. These are the same partially-folded inter- 
mediates seen in the FECs (P3 and P2P3); once again, the 
identifications were confirmed using an anti-sense oligo to 
block P2 folding (Supplementary Figure SIC). The last 
two transitions displayed very different behaviour. At 
F<9-10pN, the aptamer folded into a long-lived state 
30.3 ± 0.7 nt shorter than P2P3 (Figure ID, yellow). 
Since only 31 nucleotides remain unfolded once P2 and 
P3 have formed, this must be the fully-folded state F. 
Before F formed, however, a rarely- and transiently- 
occupied state ~ 5 nm shorter than P2P3 was typically 
seen (Figure ID, green), at the extension expected when 
the loop-loop interaction has formed to pre-organize the 
binding pocket and only PI is unfolded (8.6 ± 0.7 nt of 
folding measured from P2P3, 8nt expected). We iden- 
tify this as the PI unfolded state (labelled 'P1_U' in 
Figure ID). This state was the only one not observed in 
the FECs. 

When 200 uM adenine was added, the qualitative be- 
haviour of the aptamer was unchanged, with the same 
progression of states seen at similar forces. The principal 
differences involved helix PI: it folded at a slightly higher 
force, and most prominently, its lifetime was greatly 
increased. Indeed, once PI folded in the presence of 
adenine, it was never observed to unfold again (on the 
typical timescale of ~lmin) without a large force 
increase. Adenine thus clearly shifts the equilibrium for 
PI folding, moving the mid-point between folded and 
unfolded PI, F X j Z , to a higher force that is above the equi- 
librium for formation of the kissing loops and possibly 
even hairpins P2 and/or P3. As a result, with adenine 
bound it was not possible to observe equilibrium folding 
of PI under constant force. The results of the constant 
force analysis are summarized in Supplementary Table S3. 

Energy landscape of the aptamer 

Using the measurements above, we reconstructed the 
five-state folding landscape of the aptamer. Because PI 
folding was never in equilibrium in the presence of 



adenine, we first used Jarzynski's equality (33) to deter- 
mine from the FECs the free energy change between F and 
P2P3. The free energy for unfolding PI, the binding 
pocket junction, and the kissing loop complex was 
found to be 10 ± 1 kcal/mol without adenine and 
18 ± 2 kcal/mol with 200 uM adenine. Comparing to the 
free energy predicted for the secondary structure of PI and 
the junction loops, estimated as approximately 
—8.6 kcal/mol (Supplementary Data), our results indicate 
a stabilization energy of ~— 1.5 kcal/mol from tertiary 
interactions that form in the absence of adenine (princi- 
pally the loop-loop complex), and a more substantial 
—8 kcal/mol of additional stabilization upon adenine 
binding. This is consistent with NMR results showing 
some tertiary interactions without adenine but consider- 
able stabilization of tertiary interactions upon adenine 
binding (16,34), with a binding energy of 
approximately —8 kcal/mol as calculated from dissociation 
constants obtained by fluorescence studies (12). 

We next found the height and location of the barrier 
for unfolding PI by analysing the distributions of 
unfolding forces from the FECs. Representative unfolding 
force distributions measured at similar loading rates 
(~40pN/s) with and without adenine (Figure 2 A) show 
a typical increase of several pN in unfolding force upon 
adenine binding. These distributions are well fit by a 
quantitative model based on the shape of the energy 
landscape (35): 
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k 0 ff is the unfolding rate at zero force, Ax* is the distance 
to the transition state from the folded state, AG"'" is the 
height of the energy barrier, r is the loading rate, k B is 
Boltzmann's constant and v is a parameter characterizing 
the shape of the energy barrier (v = 1/2 for a cusp-like 
barrier, v = 2/3 for a softer cubic potential). A comple- 
mentary analysis of the kinetics based on the cumulative 
probability of unfolding yielded the lifetime of the folded 
state as a function of force and hence the unfolding 
rate k{F) (36). Values of k(F) from datasets at different 
loading rates collapse to a single curve (36), well fit by 
the same type of landscape model using Equation (2) 
(Figure 2B). 

Distributions measured at loading rates ranging from 
5-400 pN/s, with and without adenine, were analysed by 
both methods. We averaged the results obtained under the 
two limiting cases for the shape of the energy barrier 
(v = 1/2, v = 2/3), since this shape is unknown. The un- 
folding rate at zero force was found to be 1.5 x 10~ 5 s _1 
(log fe 0 ff = —4.8 ± 0.1) without adenine and 3 x 10 — 7 s — 1 
(log k otY = —6.5 ± 0.4) with adenine bound. The barrier 
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Figure 2. Unfolding force distributions and force-dependent rates 
determined from FECs. (A) The unfolding force is higher with 
adenine bound (red) than without adenine (blue). Solid lines: fits to 
Equation (1). Error bars show SD. (B) Unfolding rate as a function 
of force, fit to Equation (2). Error bars show SEM. The barrier to 
unfolding is located at the first base triple between PI and the 
junction loop (inset, red). 



height was 12.3 ± 0.4kcal/mol without adenine, and 
16 ± 2kcal/mol with adenine bound. The barrier 
position did not depend on adenine binding: 
A.r 1 = 6.2 ± 0.6nm with adenine and 6.1 ± 0.5nm 
without. Since ssRNA has an extension of 
0.40-0.45 nm/nt in the force range 12-20 pN matching 
the most probable unfolding forces, this result places the 
transition state for unfolding at 7-8 bp along helix PI. A 
comparison to the crystal structure of the ligand-bound 
aptamer (7) reveals that the transition state is located next 
to the first base pair in PI involved in triplex interactions 
with the junction loops: nucleotides U20:A76 in PI and 
U49 in the loop, shown in red in Figure 2B (inset). These 
interactions therefore appear to act as structural keystones 
preventing unfolding. 

Finally, these FEC results for PI unfolding were 
combined with the constant force results for all the 
other transitions to reconstruct piecewise the energy land- 
scape for the native folding pathway of the aptamer at a 
constant force of F = 10.8 pN, with and without the 
binding of an adenine molecule (Figure 3). The transitions 
other than PI unfolding were analysed as sequential 
two-state processes. The relative positions and energies 
of the states were determined from constant-force exten- 
sion histograms, whereas the positions and heights of the 
energy barriers between states were determined from the 
force-dependent kinetics (15,37). 




Figure 3. Folding energy landscapes. The key features of the energy 
landscapes for the five-state native folding pathway were reconstructed 
from piecewise two-state analyses of each transition. Energies and pos- 
itions are plotted with reference to the P2P3 state. Error bars show 
SEM. Dotted lines indicate notional landscape shapes in the presence 
(red) and absence (black) of adenine. Cartoons indicate the structural 
features associated with each state. 



Misfolded states in the aptamer 

Intriguingly, an additional set of states with different ex- 
tensions and distinct features (Figure 4A) was observed 
occasionally at medium forces (~10pN). Most prominent 
was a long-lived state labelled 'M' in Figure 4A. This state 
was observed for all molecules, independent of adenine 
concentration (Supplementary Figure S2A), occurring at 
a rate of ~1 min~ with a lifetime of ~ 110 s. Within state 
M, additional transient folding events were also seen, such 
as the spike labelled 'M3' in Figure 4A, with lifetimes of 
~10ms. The extension change from U to M, determined 
from extension histograms (Supplementary Figure S2B), 
corresponds to 31 ± 1 nt of folded RNA. This does not 
match the length of any structural feature in the regular 
folding pathway of the aptamer. As the aptamer almost 
always (~95% of the time) entered and exited this state via 
the fully unfolded state, rather than any of the partially 
folded intermediates, we therefore attribute it to an 
off-pathway, 'misfolded' structure. 

A search of suboptimal structures using mfold (38) and 
HotKnots (39) revealed two possible candidates for state 
M: a potential pseudoknot involving 29 nts that would 
normally form hairpin P2 and the 5'-end of PI, or a 
hairpin made from 32 nts which would normally form 
the 3' stem of P2 through to the 5' stem of P3 
(Figure 4B). By using anti-sense oligos to block in turn 
the 5' stem of PI and the 8-nt junction loop between P2 
and P3 (J2/3), we found that M still forms when PI is 
blocked (Figure 4C) but not when J2/3 is blocked, 
indicating that M is the hairpin, not the pseudoknot. 
Evidence for at least two additional, transient folding 
transitions out of state M was found from the extension 
histograms (Figure 4D): one to a state that is 2.6 ± 0.5 nm 
shorter than M (labelled 'M2') and another to a state that 
is 7.1 ± 0.5 nm shorter ('M3'). These distances are consist- 
ent with two of the extra misfolded states that may form 
along in concert with M (inset, Figure 4D): an 11 -nt 
hairpin causing an extension decrease of 2.5 nm (M2: nu- 
cleotides 24-34) and a 16-nt helix-bulge decreasing the 
extension by 6.5 nm (M3: nucleotides 30-36 and 69-77). 

The misfolded state clearly prevents the adenine binding 
site from forming, which would negate the regulatory 
ability of the riboswitch. However, we do not expect 
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Figure 4. Misfolded states. (A) Occasionally a state with extension between P3 and P2P3 (labelled 'M') forms from the unfolded state. It does not 
lead to any of the other states on the pathway to the natively folded state F — it begins and ends at U — but it does lead occasionally to other 
misfolded states with different extensions, such as the state M3. (B) Two potential misfolded structures include a pseudoknot (nucleotides 17-45) and 
a hairpin (37-68). (C) With a DNA oligomer bound to the 5' strand of PI, the misfolded state still occurs, indicating that misfolding does not involve 
the pseudoknot. The P1_U state is still observed even when PI is prevented from forming. (D) Extension histograms of the off-pathway, misfolded 
states reveal three distinct misfolded states which are well fit by Gaussian distributions. Insets: structures associated with each state. (E) Folding 
pathways of the aptamer. The native pathway involves sequential formation of hairpins P3 and P2, then the P2-P3 kissing loop complex (P1_U) and 
finally the fully-folded aptamer. 0.3% of the time the unfolded state folds into state M, preventing formation of the adenine binding pocket and 
leading to other off-pathway, misfolded states. 



state M to be biologically relevant: its folding rate is much 
slower than that of P2 and P3, and it forms only very 
rarely in our in-vitro assay (only ~0.3% of the folding 
transitions initiated from U go to M). It would likely 
form even more rarely co-transcriptionally, because P2 
would be expected to fold first. Numerous RNAs are 
known to misfold into non-native structures simply due 
to the rugged energy landscape generated by alternative 
patterns of basepairing (40), although few have been 
studied with single-molecule methods allowing individual 
trajectories to be followed (41,42). Combining all the 
results above, we can build an integrated picture of 
the alternative pathways, each with multiple states 
(Figure 4E). 

Full-length riboswitch 

In order to probe the interaction between the aptamer and 
the expression platform, we also made constructs 



containing the full riboswitch sequence, consisting of the 
aptamer plus the expression platform. FECs were 
measured by pulling the traps apart until the riboswitch 
unfolded, then rapidly bringing the traps together, waiting 
5 s for folding and ligand binding, and repeating. FECs 
measured in the presence of 200 uM adenine (Figure 5A) 
look very similar to FECs of the adenine-bound aptamer 
alone: the riboswitch unfolds between two principal states 
at relatively high force, with an unfolding force distribu- 
tion (Figure 5C) that matches the result found for the 
adenine-bound aptamer (Figure 2A, red). The contour 
length change between the two states, determined from 
WLC fits to 990 FECs from two molecules, was 
AL C = 40.7 ± 0.5 nm. This is very close to AL C for 
complete unfolding of the aptamer alone (39.7 nm), 
indicating that the aptamer is almost always folded, the 
expression platform is unstructured, and the riboswitch is 
in the 'on' state for gene expression (Figure 5A, inset). 
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Figure 5. FECs of the full-length riboswitch. (A) With adenine bound, the aptamer is folded ('on' state) and the contour length change is the same as 
for the aptamer alone. Solid lines: fits to 'on' (grey) and unfolded (green) states. Inset: secondary structure in the 'on' state. (B) Without adenine, the 
expression platform is folded (red, blue, brown) more often than the aptamer (black). Solid lines indicate fits to 'on' (grey), 'off (orange) and 
unfolded (green) states. Dotted lines indicate partially-folded intermediates of the aptamer alone: P1_U (blue), P2P3 (pink) and P3 (cyan). Inset: 
secondary structure in the 'off state. (C) The unfolding force distribution of FECs measured with adenine present matches that expected for the 
adenine-bound aptamer. Red line: fit to Equation (1). (D) The unfolding force distribution without adenine reveals two sub-populations. The 
higher-force peak (black) comes from curves initially in the 'on' state and matches the distribution expected for the adenine-free aptamer [red 
line: fit to Equation (1)]. The lower peak (orange) comes from curves initially in the 'off state and matches the distribution expected for the 
expression platform alone [brown line: fit to Equation (1)]. (E) Unfolding force distribution of the expression platform alone [brown line: fit to 
Equation (1)]. The most likely force is lower than for the 'off state in (D) due to a 15-fold lower loading rate. 



When no adenine is present, the signature of aptamer 
unfolding is still observed some of the time (Figure 5B, 
black), but a majority of the FECs display qualitatively 
different behaviour (Figure 5B: red, blue and brown). 
These two sub-populations of unfolding behaviour are re- 
flected very clearly in the unfolding force distribution, 
which reveals two distinct peaks for the force of the first 
unfolding event in each curve (Figure 5D). The 
higher-force peak (Figure 5D, black) results from the 
curves where the contour length change indicates 
the aptamer is initially folded and the riboswitch is 'on' 
(Figure 5B, black). Fits to this part of the distribution 
using Equation (1) (Figure 5D, red) return the parameters 
expected for the adenine-free aptamer (compare to 
Figure 2A, blue), confirming the identification of the 
state as 'on'. The riboswitch was observed to be 'on' 
~35% of the time, even though no adenine was present. 
This 'leakage' rate under conditions that should turn the 
expression off is within the typical range of rates for 
similar regulatory elements such as transcription termin- 
ators (43,44). 

The lower-force peak is due to the other, majority class 
of FECs (Figure 5B: red, blue and brown), which lack the 
single prominent sawtooth feature seen for the aptamer 



alone and display instead a larger number of smaller, 
repeated unfolding/refolding transitions. In these curves, 
the first unfolding event occurs at considerably lower force 
(~5— 8 pN) than for the 'on' state, as seen from the unfold- 
ing force distribution (Figure 5D, orange). The contour 
length of these curves also starts off shorter than that 
of the 'on' state: fitting the low-force region of these 
FECs to a WLC model (Figure 5B, orange) reveals 
AL C = 52.2 ± 0.9 nm to the fully-unfolded state 
(Figure 5B, green), measured from 1989 FECs on 4 mol- 
ecules. This agrees well with the value of 53.4 nm expected 
for unfolding the full riboswitch in the 'off state, 
assuming that the expression platform is folded as well 
as P2, P3, and the loop-loop complex (Figure 5B, inset). 
The low unfolding force observed is also consistent with 
unfolding the expression platform, since its three bulges 
and low G:C content reduce its mechanical stability 
(24,37). Indeed, the low-force peak of the distribution in 
Figure 5D matches the unfolding force distribution 
measured using constructs that contain the expression 
platform alone, shown in Figure 5E (1400 FECs on 2 mol- 
ecules). With no adenine present, we observed the 
riboswitch to be in the 'off state ~65% of the time. 
Interestingly, the riboswitch is in the 'on' state ~2.5 
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Figure 6. Switching and ligand binding. A riboswitch, initially 'off in 
the presence of adenine, switches several times between 'off and 'on', 
eventually binding adenine and remaining in the thermodynamically 
more stable adenine-bound 'on' state before unfolding completely. 



times more frequently when adenine is present than when 
it is not, matching well the ~3-fold increase in expression 
levels observed in vitro (9). 

In the FECs measured without adenine, the riboswitch 
is seen to fluctuate between six different states: the five 
states of the aptamer alone and the 'off state with expres- 
sion platform folded. Indeed, what looks like noise in the 
FECs between the 'off state and the unfolded state is 
consistent with frequent inter-conversion between the dif- 
ferent partially-folded states of the aptamer, shown in 
Figure 5B as dotted lines. However, the unfolding of the 
full riboswitch from the 'off state is not simply sequential, 
as for the aptamer alone, even though the unfolding force 
for the expression platform is several pN lower than that 
of any other structural element of the riboswitch. Instead, 
the presence of two alternative, fully-folded states is 
observed directly, through switching between the 'on' 
and 'off states. 

Examples of this behaviour are seen in Figure 5B 
(brown curve at ~6 pN, blue curve at ~8-9 pN). It is 
even seen very occasionally with adenine present, as in 
Figure 6. Here, the riboswitch is initially folded in the 
'off state (without adenine bound), but at F ~ 5 pN the 
contour length twice increases briefly to the 'on' state 
value before returning to the 'off state, i.e. the folded 
expression platform is transiently replaced by a folded 
PI. A last switching event is seen at ~7pN, when the 
'off state unfolds to the contour length expected for the 
Pl-unfolded state (i.e. with repressor hairpin completely 
unfolded) before PI folds to produce the 'on' state one last 
time. Adenine clearly binds to the aptamer soon there- 
after, since the 'on' state is stabilized and does not 
unfold until a high-force characteristic of the 
adenine-bound state. 



DISCUSSION 

These results provide new insight into the relation between 
folding and functional mechanisms in the add riboswitch, 



both at the level of the aptamer alone as well as the 
full-length riboswitch, especially through comparison to 
previous results for the pbuE riboswitch (15). While the 
two riboswitches have aptamers with very similar struc- 
tures, they operate by quite different mechanisms: trans- 
lation activation for add as opposed to transcription 
anti-termination for pbuE. Comparison of the behaviour 
of the aptamers reveals many commonalities due to the 
similar structures, but also some important differences. In 
both cases, a total of five states were observed, and the 
aptamers were found to fold sequentially: first the two 
hairpins (P2 and P3) forms, then the loop-loop interaction 
and finally PI. Crucially, the helix PI — which acts as the 
mechanical switch — is the last part of the molecule to fold. 
In both cases, this helix is strongly stabilized by adenine 
binding, whereas the energetic and kinetic properties of 
the other states are not affected (Figure 3). The recon- 
structed energy landscapes are in fact quantitatively very 
similar, due to the extensive structural and sequence 
similarities between the two aptamers. 

The landscapes in Figure 3 also highlight some of the 
important differences between the aptamers. First, the 
order of folding of the two hairpin loops is reversed: P3 
folds at higher force than P2 in the add aptamer, vice versa 
for the pbuE aptamer. This effect was predicted by simu- 
lations (45) and is due to the different relative stabilities of 
the helices P2 and P3 in the two different aptamers. More 
fundamental is the difference in stability of helix PI. In the 
add aptamer, PI is similar in stability to P2 and P3 even in 
the absence of adenine (F X j 2 ~ 10 pN) and it has a low 
unfolding rate at F = 0 (/r Q ff ~ 10~ 5 s -1 ). In contrast, PI in 
the pbuE aptamer is quite unstable without adenine bound 
(F\j 2 = 3 pN) and has an unloaded unfolding rate k o(i that 
is 10000 times higher. As a result, the pbuE aptamer is 
much more susceptible to strand invasion by the expres- 
sion platform, the process which is required to form the 
terminator hairpin and switch the gene off. 

The different stabilities of PI appear to be related 
directly to the different functional mechanisms of the 
riboswitches. For the pbuE riboswitch, the conformation 
with terminator hairpin folded and aptamer unfolded is 
the most stable thermodynamically, regardless of whether 
or not adenine is bound to the aptamer; both in vitro 
(13,15,19) and in vivo (9) measurements point to a mech- 
anism whereby the outcome of riboswitch folding (and 
hence level of gene expression) is governed by a competi- 
tion between the kinetics of aptamer folding, ligand 
binding and transcription elongation. In such a situation, 
an unstable PI is what is needed to permit strand invasion 
on a timescale competitive with transcription rates. 
Kinetic control of riboswitch folding has also been 
demonstrated for the FMN riboswitch (20). Our measure- 
ments on the full-length add riboswitch, in contrast, show 
quite clearly that the thermodynamically stable conform- 
ation switches upon adenine binding: the state with the 
aptamer folded and expression platform unfolded ('on') 
is dominant, and hence most stable, when adenine is 
bound, but the state with the expression platform folded 
and aptamer partially-unfolded is dominant when adenine 
is not bound. Results such as those in Figure 6 encapsulate 
the essence of the folding of this riboswitch: the RNA 
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Figure 7. Folding kinetics and riboswitch mechanism. (A) The aptamer refolding rate as a function of force without adenine is determined from the 
distribution of refolding forces when ramping the force down from denaturing values. Error bars show SEM. Red line: fit to Equation (2). (B) The 
fraction of unfolding FECs in the off state as a function of refolding time at low force in the absence of adenine shows an exponential rise as 
the riboswitch structure equilibrates into the more stable 'off state. (C) Schematic of the riboswitch mechanism. The aptamer folds rapidly before the 
expression platform is transcribed, regardless of adenine binding. If adenine binds to the aptamer. it stabilizes the 'on' state of the riboswitch 
(aptamer folded, ribosome binding site exposed). Without adenine binding, the 'on' state is unstable and equilibrates into the 'off state (PI unfolded, 
ribosome binding site sequestered). RBS: ribosome binding site. 



inter-converts from 'off to 'on', with the outcome ultim- 
ately determined by thermodynamic changes due to ligand 
binding. 

It was previously suggested, based on the 
adenine-binding ability of the full-length riboswitch as 
well as mfold (38) predictions of secondary structure 
energies for the 'on' and 'off states, that the 'off state is 
more stable than the 'on' state without adenine but still 
readily fluctuates into the 'on' state in an equilibrium 
(9,12). Our observations directly show that this is, in 
fact, the case. Given that the 'off state is occupied ~1.7 
times more often than the 'on' state in the absence of 
adenine, the 'off state is more stable by ~0.3kcal/mol 
without adenine bound. This agrees within error with 
our estimate of the energy difference between the 'on' 
and 'off states obtained by adding to secondary-structure 
energy calculations the energies we measured for tertiary 
interactions and ligand binding: the 'off state should be 
more stable than the 'on' state by ~1.4kcal/mol when 
no adenine is bound, whereas the 'on' state should be 
more stable by ~6.6kcal/mol with adenine bound 
(Supplementary Data). 

In order to demonstrate that the regulatory mechanism 
of this riboswitch is controlled thermodynamically, we 
also measured the kinetics of the folding. We first 
determined the folding rate of the aptamer (the 'on' 
state) at F = 0 from refolding FECs measured in the 
absence of adenine while ramping the force from 
~20pN (where the aptamer is completely unfolded) 



down to OpN. A total of 1227 refolding FECs were 
measured, and the cumulative folding probabilities were 
analysed (36) as in Figure 2B. The average refolding times 
plotted as a function of force (Figure 7A) were then 
extrapolated to zero force by fitting to Equation (2), re- 
sulting in an unloaded folding rate of ~30s -1 . Since tran- 
scription in vivo occurs at a typical rate of ~40-50nt/s 
(46,47) and the expression platform contains 40 nt, this 
folding rate implies that the aptamer should fold com- 
pletely well before the repressor hairpin has even been 
partially transcribed, regardless of adenine binding. 
Hence the riboswitch always starts in the 'on' state. 

Since adenine binding locks the riboswitch into the 'on' 
state with a large binding energy and low unfolding rate, 
then if the riboswitch is to act as a switch, clearly the 'on' 
state must be able to equilibrate into the 'off state rapidly 
in the absence of adenine, before the ribosome binds the 
Shine-Dalgarno sequence. To determine this equilibration 
rate, 5012 FECs were measured on 11 molecules while 
varying the time spent waiting at low force to permit 
equilibration, from 0.3-5 s. An exponential fit to the 
fraction of molecules found in the 'off state as a 
function of time (Figure 7B) yielded an equilibration 
time of x eq = 0.2 ± 0.1 s. The ribosome binding time is 
not well known, but a lower bound (even for coupled 
transcription and translation) is provided by the time 
required to transcribe the mRNA footprint for the 
ribosome. Footprinting studies and crystal structures 
(48,49) imply this involves another ~15nt downstream 
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of the 3'-end of the riboswitch expression platform, 
requiring ~0.3 s at typical transcription rates (or 1.5 x eq ). 
Hence the 'on' and 'off states should be substantially (if 
not completely) equilibrated by the time the ribosome 
binds, strongly supporting a thermodynamically- 
controlled regulatory mechanism. 

The picture of the riboswitch mechanism suggested by 
our results is summarized in Figure 7C. During the tran- 
scription of the expression platform, the riboswitch folds 
into the 'on' state. If adenine does not bind to the aptamer 
after the expression platform is fully transcribed, the 'on' 
state is no longer thermodynamically stable and the 
riboswitch equilibrates rapidly into the lower-energy 'off 
state, repressing gene expression but still allowing the 'on' 
state to be populated at an appreciable level. If adenine 
does bind to the 'on' state, then it becomes greatly 
favoured thermodynamically and the riboswitch stays 
'on'. 
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